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1*.  ABSTRACT 

111.  *P«N*//f»INO  MlLlTiPT  ACTIVITT 

1  Deputy  Chief  of  Staff  for  Personnel, 

|  Washington,  D.C. 

•  US  Continental  Array  Cmd,  Ft.  Monroe,  Va. 

Army  personnel  managers  have  a  continuing  need  to  select,  classify,  and  assign 
to  training  and  jobs  large  numbers  of  men  who  enter  the  service..  The  Array  Classifi¬ 
cation  Battery  (ACB)  is  an  Integral  part  of  the  assignment  process;  the  Array  Research 
Institute's  Training  Technology  and  Classification  Work  Unit  Area  has  an  ongoing 
research  program  to  keep  the  classification  battery  effective  and  up  to  date.  As  part 
of  the  overall  effort,  a  new  ACB  and  aptitude  area  system  have  been  developed  which 
result  in  ar,  improved  system  of  classification  for  training  and  jobs.  Technical 
Research  Report  1177  describes  these  new  tests,  and  Technical  Research  Note  239  eval¬ 
uates  them  Si  predictors  of  success.^. The  present  publication  addresses  the  valt;e  of 
selection  and  classification  testing  programs  in  relation  to  job  training  success  and 
the  suitability  of  the  tests  for  subgroups  of  the  manpower  available  to  the  Army. 

^  The  various  testing  programs  in  the  Array's  enlisted  personnel  system  are  de¬ 
scribed;  the  relationships  between  testing  program,  training  content  sad  method,  and 
utilization  on  the  job  are  probed;  and  the  methodology  is  explained  by  which  the 
validity  of  tie  tests  is  established.  Analysis  of  measures  of  performance  in  job 
training  programs  and  ratings  of  job  performance  reveals  that  training  performance  is 
more  satisfactory  than  job  ratings  for  evaluating  teat  effectiveness.  How  well  tests 
predict  performance  in  training  programs  and  the  relation  between  test  scores  and 
other  indexes  of  success  are  examined  separately  for  Negroes  and  whites. 
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13.  ABSTRACT  continued 


^'"Seliecticn  and  classification  tests  are  effective  in  identifying 
potential  'faiivr.es^  in  Army  training  program  and  for  assigning  men  to 
jobs  where  their  potential  is  best  used  and  where  they  can  best  serve 
the  Army.  Aptitude  test  scores  nre  useful  indicators  of  the  proficiency 
and  grade  a  man  can  attain,  of  the  time  required  to.  .bring  a  trainee  to  a 
minimum  level  of  performance,  and  in  identifying  general-  categories — men 
eligible  for  officer  training,  for  example.  The  tests  are  not  only  re¬ 
lated  to  rate  of  promotion  in  the  Army  but  to  civilian  earnings  after 
separation  from  service,  for  both  Negroes  and  whites.  The  testa  are. 
equally  effective  in  predicting  .performance  in  job  training  for  Negroes 
and  whites.  While  Negroes  make  lower  average  scores,  their  training  per¬ 
formance  is  correspondingly  lower,  comparable  to  that  of  whites  with  the 
name  level  of  test  scores. 

,  I  A  general  criticism  of  tests,  particularly  military  tests,  has 
arisen  in  recent  years.  Analysis  supports  the  usefulness  of  tests  in  ihe 
Army's  personnel  systems  despite  changing  concepts  in  the  Army's  mission, 
•particularly  as  the  tests  themselves  are  continually  reexamined  to  meet 
changed  requirements . 
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FOREWORD 


ARI's  Training  Technology  &  Classification  Work  Unit  Area  applies  psychological 
measurement  methods  to  enable  the  Army  to  make  best  use  of  ths  skilii  anti  aptitudes  of  its 
enlisted  personnel  through  increasingly  accurate  and  differentiated  measures  of  m^r.-itfeisl 
potential.  Research  is  conducted  to  maintain  and  improve  the  efieetiveness  a-.  Army 
Clarification  Battery  and  related  techniques  and  tc  assess  the  impact  of  conditions  that  may 
interact  with  the  classification  test3  and  thus  effect  the  basis  for  utilization  of  th<*  oniisted 
input-changes  in  training  programs  and  job  content  and  environment,  for  example. 

sne  present  Research  Report  addresses  die  value  of  selection  and  classification  testing 
programs  in  relation  to  job  warning  success  and  the  suitability  of  the  tests  for  subgroups  of  the 
manpower  available  to  the  Army,  me  entire  research  work  unit  ares  is  responsive  to  specie! 
requirements  of  die  Deputy  Chief  of  Staff  for  Personnel  and  the  U.  a  Continental  Army 
Command,  as  wet!  as  to  objectives  of  Army  RDTE  Project  2Q0821 G6A722,  Selection  and 
Behavioral  Evaluation,  FY  1973  Work  Program. 


<T.  6,  UHLANEfT 
Technical  Director 
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BRIEF 

— - -  inini  ■ 


Requirement: 

To  provide  a  general  evaluation  of  the  Army's  selection  and  classification  testing  programs  in 
terms  of  their  contribution  to  effective  utilization  of  the  manpower  available  to  the  Army. 


Procedure: 

f  he  various  testing  programs  in  the  Army's  enlisted  personnel  system  are  described,  and  the 
relationships  between  testing  program,  training  content  and  method,  and  utilization  on  the  job  are 
probed.  A  brief  explanation  is  s;ven  of  the  methodology  by  which  the  effectiveness-that  is,  the 
vaiidity-of  the  tests  is  established.  Analysis  of  measures  of  performance  in  job  training  programs 
and  ratings  of  performance  on  trie  job  reveals  th3t  training  performance  is  more  satisfactory  than 
job  ratings  for  evaluating  the  effectiveness  of  selection  and  classification  tests.  How  well  tests 
predict  performance  in  job  training  programs  and  the  relationship  between  test  scores  and  other 
indexes  of  success  are  examined  separate!-/  for  Negroes  and  v/hites. 


Findings: 

Selection  and  classification  tests  through  twenty  years  of  research  and  experience  have 
demonstrated  their  effectiveness  in  identifying  potential  failures  in  Army  training  programs  and 
for  getting  men  into  jobs  where  the:r  potential  is  best  utilized  and  they  can  best  serve  the  Army. 
Aptitude  test  scores  are  usef  ji  indicators  of  the  level  of  proficiency  and  grade  a  man  can  attain  and 
of  the  time  required  to  bring  a  trainee  :o  a  minimum  level  nf  performance.  They  are  also  useful  in 
identifying  general  categories  input-men  eligible  for  training  to  become  officers,  for  example. 
The  tests  are  related  to  rate  of  oromotion  in  the  Army  ana  te  civilian  earnings  after  separation 
from  service.  Much  the  same  order  of  relationship  nolds  for  N'  gross  and  whites. 

The  tests  have  been  four  A  to  be  equally  effective  as  predictors  of  performance  in  job  teaming 
for  Negroes  and  whites.  While  Negroes  score  lower  on  these,  their  training  performance  is 
correspondingly  lower.  Their  training  f>s;form?rsce  is  comparable  to  that  of  whites  with  the  same 
level  of  test  scores. 


Utilization  of  Findings: 

The  present  report  analyzes  the  genera!  criticism  af  tests,  and  of  military  tests  in  particular, 
that  have  arisen  in  recent  years.  The  analysis  supports  the  usefulness  of  tests  in  the  Army’* 
personnel  systems  despite  changing  concepts  *n  the  Army’s  mission.  Long-standing  test  practices  in 
enlisted  and  officer  programs  are  continually  being  reexamined  with  a  view  to  adjusting  suen 
programs  m  the  light  of  changing  requirements. 
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Figure  2.  Relationship  among  job  level,  and  job  training 
and  aptitude  requirements  in  the  Army. 
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OVERVIEW 

Each  year,  the  U.  S.  Army  must  obtain  thousands  of  new  soldiers  and 
prepare  them  to  become  productive  workers  in  their  units.  Accomplishment 
of  assigned  missions  in  the  Army  is  critical  to  the  nation  and  often  a 
matter  of  life  and  death  to  the  individual  soldier.  Selection  and  classi¬ 
fication  testing  has  been  used  by  Army  personnel  managers  for  many  years 
to  enhance  the  Arne’s  capability  to  accomplish  its  mission.  The  effec¬ 
tiveness  of  selection  and  classification  testing  in  the  Army  is  examined 
in  the  present  report.  A  description  of  the  tests  is  given,  together 
with  an  explanation  of  how  their  effectiveness  is  determined.  Their 
effectiveness  in  predicting  job  training  and  other  indexes  of  success  is 
examined  separately  for  Negroes  and  whites.  Finally,  the  interrelation¬ 
ship  of  tests,  training,  and  utilization  on  the  job  is  considered. 

Army  personnel  managers  need  efficient  and  effective  means  of  identi 
fying  men  who  can  j.ofit  from  jcb  training  and  contribute  to  accomplish¬ 
ment  of  unit  missions;  men  who  qualify  for  training  must  then  be  classi¬ 
fied  to  determine  the  types  of  job  in  which  they  can  profitably  serve 
the  Army. 

Historical  solutions  to  problems  of  selection  and  classification, 
both  military  and  civilian,  have  relied  on  personal  interviews,  informa¬ 
tion  about  prior  education  and  work  experience,  trial-and-error  decisions, 
even  random  assignment.  All  these  techniques  are  costly  in  terms  of  tire 
required  to  arrive  at  a  reasonable  decision  or  in  terms  of  errors  in 
assignment,  or  both.  The  modem  solution  used  by  the  Army  and  by  the 
other  armed  services  m  *-his  country  is  selection  and  classification 
testing. 

The  first  decision  the  Army  makes  about  an  individual  is  whether  1  »• 
should  be  accessioned — that  is,  inducted  or  accepted  for  military  service 
To  be  accessioned  a  person  must  be  mentally,  physically,  and  morally 
qualified.  Mental  qualification  is  determined  primarily  on  th-c  basis  >f 
the  Armed  Forces  Qualification  Test  (AFQT).  For  marginal  men,  supplemen¬ 
tary  evidence  of  trainability  is  required.  Marginal  men  are  those  «ho 
are  above  the  minimum  level — above  the  bottom  10  percent  of  the  popula¬ 
tion  of  young  men  aged  l8-25--but  below  the  top  70  percent.  Most  of  the 
tup  70  percent  can  succeed  in  some  Army  job,  and  they  are  accepted  with 
no  further  mental  qualification.  The  bottom  10  percent  includes  a  group 
in  which  a  preponderant  percentage  would  be  failures  in  the  Array.  Ken 
in  this  category  are  therefore  not  accepted  for  service.  Ken  in  the 
marginal  group  may  succeed  in  the  Army  under  favorable  conditions.  To 
identify  marginal  men  who  are, likely  to  meet  training  and  ^ob  require¬ 
ments,  Che  Army  has  usually  required  additional  evidence  of  trainability. 
This  evidence  is  obtained  from  aptitude  scores  on  the  Army  Qualification 
Battery  (AQB). 


\ 
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For  each  man  accessioned  into  Army  enlisted  service,  the  second 
decision  is  determination  of  the  job  area  for  which  he  is  best  qualified 
The  Army  Classification  Battery  (ACB)  is  used  to  classify  men  according 
to  their  aptitudes  or  potential  for  success  in  different  Army  jobs. 


TESTS  USED  IN  SELECTION  AND  CLASSIFICATION 

The  AFQT  is  a  measure  of  general  tra inability  composed  of  100 
questions ,  or  items,  equally  divided  among  word  knowledge,  arithmetic 
reasoning,  spatial  perception,  and  knowledge  of  tool  functions.  Items 
.similar  to  items  in  the  AVQT  are  widely  used  in  the  civiiicn  sector  to 
measure  potential  for  success  in  formal  academic  training  and  in  selec¬ 
tion  for  jobs  of  all  types  at  all  levels.  The  AFQT  yields  a  single 
global  assessment  of  potential  to  perform  successfully  in  the  variety 
of  Army  job  training  programs.  The  AFQT  is  described  in  Appendix  A. 

The  Army  Qualification  Battery  (AQB)  is  used  as  a  supplementary 
screen  to  determine  mental  qualification  by  measuring  the  individual's 
strengths  and  weaknesses  for  different  kinds  of  jobs.  The  current  A.QB 
assesses  the  person's  potential  in  seven  aptitude  or  job  areas:  Infan¬ 
try.  Armor-Artillery-Engineering,  Electronic  Maintenance,  Mechanical 
Maintenance,  General  Maintenance,  Clerical-Administrative,  and  General 
Technical.  A  Radio  Code  score  can  also  be  obtained  for  men  who  apply 
for  a  job  in  the  Radio  Code  area.  The  Army  Classification  Battery  (ACB) 
given  to  new  recruita  at  Reception  Stations,  is  similar  to  the  AQB, 
except  that  the  component  tests  are  longer.  Eight  aptitude  area  scores 
are  obtained.  These  are  the  same  as  those  obtained  from  the  AQB,  plus 
a  Radio  Code  Aptitude  score. 

Content  of  most  of  the  tests  in  the  AQB  and  the  ACB  is  similar  to 
that  of  differential  aptitude  b?tt,erie^>  used  in  the  civilian  sector  for 
vocational  guidance  or  for  job  placement,  or  both.  The  tests  used  by 
the  Army  have  been  developed  spscifically  to  meet  the  Army's  needs,  and 
too,  the  General  Information  Test  and  the  Classification  Inventory, 
have  subject  matter  unique  in  the  Army,  The  tests  making  up  the  AQB  and 
tne  ACB  are  described  in  Appendix  h. 


DEVELOPMENT  AND  EVALUATION  OF  THE  TESTS  THROUGH  RESEARCH 

The  Army  Research  Institute  for  the  Behavioral  and  Social  Sciences 
(ARl)  in  the  Office  of  the  Chief  of  Research  and  Development  has  a 
continuing  research  program  to  develop  and  improve  selection  and  classi¬ 
fication  tests.  The  research  steps  typically  followed  by  ARl  in  develop 
ing  s  test  and  in  determining  its  predictive  accuracy — or  validating 
it — are  as  follows : 

IT  ' 

The  Army  Selection  and  classification  tests  described  in  this  report 
were  in  use  prior  to  1  May  1975,  cm  which  date  a  Army  Classification 
Battery  was  introduced.  The  principles  and  conclusions  presented  in 
this  report  are  not  affected  by  that  change. 


1.  Definition  of  the  problem,  or  the  purpose  for  which  the  test 
or  battery  of  tests  is  required, 

2.  Construction  of  the  test  or  tests  d<..*gned  to  measure  aptitudes, 
interests,  oc  ether  personal  characteristics  that  predict  performance  in 
the  Army. 

3.  Deciding  how  the  soldier  performance  is  to  be  measured.  Perfor¬ 
mance  fiisy  be  in  job  training  courses  or  on  the  job,  and  constitutes  the 
"criterion"  the  tests  should  predict. 

4.  Measurement  of  soldier  performance. 

5.  Analysis  of  the  items  or  tests  to  select  those  that  contribute 
to  effective  prediction  of  the  criterion,  and  to  find  effective  combina¬ 
tions  of  predictors-  An  effective  test  or  predictor  is  one  that  shows  a 
close  correspondence  to  the  criterion  measure  of  performance. 


Measurement  of  Performance 

The  measurement  of  soldier  performance  (Step  4)  is  a  critical  step 
in  research  to  evaluate  the  effectiveness  of  a  test.  Evaluations  of  how 
well  a  man  is  performing  on  the  job  are  not  found  ready-made:  ju3t  as 
tests  have  to  be  developed  to  predict  performance ,  so  procedures  must  be 
developed  to  assess  how  well  a  man  is  performing  in  his  assignment. 

Ideally,  assessment  of  performance  o;,  all  relevant  training  and  job 
tasks  would  be  completely  accurate.  However,  since  job  duties  vary  from 
location  tc  location,  from  supervisor  co  supervisor,  and  even  from  time 
to  time,  no  universal  cr  absolute  measure  of  job  performance  can  be 
obtained.  The  best  that  can  accomplished  <s  to  cotain  an  estimate  of 
a  man's  performance  under  a  range  of  job  conditions.  The  estimate  i3 
based  on  a  sampling  of  what  a  man  does  on  a  job,  not  on  everything  that 
the  job  is  likely  to  demand  of  him.  A  critical  research  problem  is  to 
find  satisfactory  w-:ys  of  sampling  the  job  demands  to  build  an  adecuate 
criterion  of  performance  against  which  to  evaluate  tie  tests. 

Performance  testing,  that  is,  assessing  how  well  a  man  does  on  a 
specified  set  of  tasks,  has  appeal  as  a  criterion  of  competence  on  the 
job.  However,  performance  tests,  no  matter  how  carefully  and  expertly 
devised,  can  include  only  part  of  what  men  do  and  are  expected  to  do  on 
the  job.  In  addition,  performance  testing  typically  covers  or.ly  the 
minimum  job  requirements,  and  thus  does  not  asses?  the  critical  compo¬ 
nent  of  how  well  the  man  can  apply  his  skills  and  knowledge  to  new 
situations.  The  criterion  should  be  an  assessment  of  typical  performance 
over  an  extended  period  of  time,  and  performance  testing  ordinarily  is 
conducted  in  a  brief  period,  such  as  one  or  two  days.  Thus,  a  man  may  be 


highly  motivated  to  excel  on  the  limited  performance  test,  bat  his 
typical  performance  may  be  at  a  lower  level.  The  converse  may  also  be 
true:  A  man  may  get  excessively  nervous  over  the  test  and  score  below 
his  typical  level  of  performance. 

To  obtain  measures  of  typical  performance  in  real  work  environments, 
researchers  have  frequently  relied  on  ratings  of  performance  wade  by 
supervisors  or  fellow  workers,  or  both.  One  assumption  underlying  the 
use  of  ratings  is  that  a  representative  sampling  of  duty  positions, 
workers,  and  supervisors  covers  most  of  the  job  demands,  and  chus  on  the 
whole  the  relevant  aspects  of  the  job  are  included  in  the  criterion 
assessment.  Another  assumption  is  that  the  rater,  whether  supervisor  or 
fellow  worker,  considers  a  man's  performance  in  light  of  the  responsi¬ 
bilities  that  are  part  of  the  job.  The  rating  includes  not  only  how  well 
the  man  can  perform  the  tasks  but  also  how  well  he  actually  meets  the 
demands*  in  contrast  to  performance  tests,  which  assess  only  competence 
or  skill  in  accomplishing  specified  tasks,  ratings  reflect  both  compe¬ 
tence  and  willingness  to  get  the  job  done.  Ratings  suffer  from  a  short¬ 
coming  in  that  they  are  subject  to  great  variability  because  there  are 
no  objective  standards  by  which  a  worker's  performance  can  be  assessed. 
Subjective  impressions  play  a  large  part  in  ratings,  and  thus  ratings 
do  not  fully  satisfy  the  scientific  requirements  of  objectivity  and 
standard  conditions. 

The  criterion  that  is  scientifically  adequate  and  usually  preferred 
by  redearchers  is  evaluations  of  performance  in  job  training  courses. 

Such  evaluations  are  based  on  a  better  sampling  of  job  demands  than  are 
performance  tests,  and  are  more  objective  than  ratings.  During  job 
training,  a  man's  performance  is  observed  over  an  extended  period  of  time 
in  fairly  standard  situations.  Since  many  men  are  being  trained,  each 
man's  performance  can  be  compared  to  that  of  a  large  number  of  others 
who  are  also  learning  the  job.  In  contrast,  the  typical  job  environment 
has  a  limited  number  of  workers  at  the  same  level,  and  the  supervisor 
must  base  his  evaluations  on  a  limited  sampling  of  men.  In  job  training, 
objective  standards  of  performance  have  been  developed,  and  all  trainees 
aite  evaluated  against  these  standards.  The  assumptions  underlying  the 
use  of  training  grades  as  a  criterion  are  1)  that  the  curriculum  builders 
for  Army  school  courses ,  because  of  their  extensive  experience  as  workers 
and  trainers,  have  incorporated  the  most  important  job  skills  and  knowl¬ 
edge  into  the  training  programs,  and  2)  that  all  trainees  go  through 
much  the  same  instructional  materials.  Grades  obtained  in  training 
courses  provide  assessments  of  the  men's  ability  to  accomplish  a  uniform 
set  of  job  tasks,  and  also  to  a  degree  their  capacity  to  generalize  to 
new  situations.  Granted,  training  grades  are  one  step  removed  from  per¬ 
formance  on  real-life  job  defies.  However,  because  they  satisfy  scien¬ 
tific  requirements  for  extensive  sampling  of  tasks  plus  providing  objec¬ 
tive  evaluations  in  standard  situations  of  how  well  the  roan  is  doing, 
they  are  generally  used  by  researchers  as  criteria. 


Since  almost  oil  Array  enlisted  men  receive  extended  job  training 
beforj  they  are  assigned  to  their  first  job  duty  position,  v‘he  first 
decision  about  new  accessions  is  to  determine  which  job  they  should  be 
trained  tor.  Hence,  the  primary  consideration  in  developing  Array 
selection  and  classification  tests  has  been  to  predict  performance  in 
job  training  rather  than  performance  on  the  job.  Prior  to  training.  Army 
personnel  managers  know  almost  nothing  about  the  capabilities  of  the 
newly  accessioned  men  except  their  aptitude  test  scores,  and  an  effi¬ 
cient  and  effective  means  of  assessing  potential  is  a  must  to  enable 
making  intelligent  decisions.  Once  the  man  is  assigned  to  training,  his 
performance  and  potential  can  be  assessed  over  a  longer  period  of  time, 
typically  eight  weeks,  .arid,  in  most  job  training  programs,  over  a  wide 
sampling  of  tasks.  Evaluation  of  how  well  a  man  is  likely  to  perform  on 
the  job  can  be  made  more  accurately  during  training,  which  is  similar  to 
the  job  environment.  In  the  Army  personnel  system,  then,  selection  and 
classification  tests  are  fulfilling  their  function  if  they  accurately 
predict  success  in  job  training. 


SEv. 


Determining  Effectiveness  of  Tests 

Once  the  tests  have  been  constructed  and  the  criterion  measure  devel 
oped,  the  relationship,  or  correlation,  between  the  test  end  criterion 
measure  is  determined.  The  degree  of  relationship  shows  the  accuracy 
with  which  test  scores  can  predict  criterion  performance,  and  i'c  is 
ordinarily  expressed  as  a  correlation  coefficient.  The  relationship 
between  test  and  criterion  performance  is  called  "validity,"  and  the 
correlation  between  test  and  criterion  is  referred  te  as  the  "validity 
coefficient."  A  validity  coefficient  of  zero  means  no  relationship  ana 
hence  no  prediction.  A  coefficient  of  1.0  means  that  criterion  perfor¬ 
mance  is  perfectly  predictable  from  the  test.  Selection  and  classifica¬ 
tion  tests  used  by  the  Aren-  have  predictive  accuracy  that  falls  between 
these  extremes,  usually  yielding  validity  coefficients  of  .5  and  .6. 

The  most  useful  interpretation  of  a  correlation  coefficient  in  a 
personnel  system  was  developed  by  Hubert  Brogden,^  formerly  chief 
scientist  of  what  is  now  the  Army  Research  Institute.  Interpretation 
cf  a  coefficient  is  in  terms  of  the  results  that  would  obtain  by  using  a 
test  with  no  validity,  at  one  extreme,  and  using  a  perfectly  valid  test, 
at  the  other  extreme.  A  valid  test  improves  the  effectiveness  of 


“'Brogden,  H.  E.  On  the  interpretation  of  the  correlation  coefficient 
as  a  measure  of  predictive  efficiency.  Journal  of  Educational 
Psychology .  57 .  S5~76,  1946. 
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personnel  decisions  above  those  made  on  a  chance  or  random  basis.  Say 
for  example,  that  the  Amy  accessions  1000  enlisted  men  on  a  particular 
day.  If  the  men  are  selected  without  using  any  valid  scores  indicating 
ability  to  perform,  then  selection  would  be  random,  half  the  selected 
men  would  be  afcove  average,  and  the  other  half  below  average. 

With  a  perfectly  valid  test,  the  true  level  of  performance  of  each 
man  can  be  identified  beforehand.  Only  the  50  percent  who  are  known  to 
be  above  average  in  performance,  for  example,  could  be  selected. 

Although  perfect  prediction  is  not  possible  in  actual  practice  and  tests 
with  zero  validity  are  generally  net  employed,  these  extremes  provide 
convenient  anchor  points  for  evaluating  operational  tests,  which  fall 
between  these  two  extremes. 

The  average  performance  of  a  randomly  selected  group  or  of  a  group 
selected  on  the  basis  of  tests  with  zero  validity  is  100  on  the  Army 
standard  score  scale.  Above  average  performance  is  indicated  by  scores 
above  iGQ:  the  higher  the  score,  the  better  the  ^erfonnance ,  Since  an 
operational  test  has  some  validity,  a  group  of  men  selected  by  the  test 
would  be  above  average;  that  is,  their  mean  performance  would  be  greater 
than  100.  To  determine  how  much  greater,  the  other  end  of  the  perfor¬ 
mance  scale  must  be  anchored. 

The  high  end  of  the  performance  scale  is  realized  when  a  perfectly 
valid  predictor  is  used.  Although  no  test  cun  be  perfectly  valid 
because  level  of  performance  cannot  be  perfectly  measured,  there  is 
enough  agreement  about  quality  of  performance  to  make  perfect  measure¬ 
ment  of  performance  a  useful  concept.  Assuming  that  level  of  perfor¬ 
mance  can  be  perfectly  identified,  how  well  the  selected  group  performs 
depends  or.  what  percentage  of  the  population  of  eligible  men  hau  been 
accepted.  If  only  men  above  average  were  desired,  then  men  in  the  top 
50  percent  oi.  the  scores  would  be  selected.  On  the  Army  standard  score 
scale,  this  would  correspond  to  an  aptitude  area  score  of  100  or  better. 
On  the  performance  scale,  ‘he  top  50  percent  would  have  an  average  of 

ll6a'. 

The  maximum  possible  gain  from  selecting  50  percent  of  the  men, 
then,  is  16  points.  If  fewer  men  are  selected,  then  the  gain  would  be 
more.  For  example,  if  only  the  top  50  per  ent  are  selected,  the  maximum 
gain  would  be  2J  points.  If  more  men  are  selected,  the  maximum  gain 
would  be  less.  For  example,  if  only  the  rottom  10  percent  were  excluded, 
the  possible  gain  would  be  4  points,  for  .n  average  of  104. 


The  value  116  was  statistically  derived  by  assuming  ch3t  performance 
is  normally  distributed  with  a  standard  deviation  of  ?0  and  that  the 
bottom  half  of  the  population  v;as  not  selected. 


Since  Army  tests  generally  have  correlation  coefficients  of  about 
.6  with  training  performance }  the  gain  from  using  them  is  60  percent  of 
the  maximum,  tn  the  above  examples ,  if  the  top  50  percent  on  the  test 
are  selected ,  the  average  performance  of  these  selected  men  would  be 
l09«6  (16  points  times  .60  plus  100;,  a  gain  of  9*6  points  over  random 
selection.  If  the  top  30  percent  on  the  test  are  selected  this  group 
would  have  an  average  performance  of  II3.8,  a  gain  of  I5.8  points. 
Finally,  if  only  the  bottom  lQ  percent  are  excluded,  the  gain  would  be 
2.4  points.  With  a  correlation  coefficient  of  .5  between  cast  and  per¬ 
formance,  the  maximum  gain  would  be  cut  in  half.  In  general,  a  correla¬ 
tion  coefficient  of  any  value  can  be  directly  translated  into  a  percen¬ 
tage  of  the  maximum  gain  possible  as  compared  to  random  selection. 

Tne  increased  level  of  oerformonce  can  also  be  translated  into 
percentage  of  selected  men  ?ho  perform  above  the  average  of  the  popula¬ 
tion  of  young  men  eligible  for  military  service.  If  a  test  has  no 
validity,  50  percent  are  above  and  50  percent  below  the  population 
average,  no  matter  what  percentage  is  selected.  If  1000  men  are  examined 
on  a  test  with  no  validity  and  the  top  500  scorers  are  selected,  th'” 

250  of  the  selected  men  wjuld  be  above  and  250  below.  If  l000  men  were 
examined  on  a  perfectly  valid  test  and  the  top  500  were  selected,  then, 
of  course  all  50O  would  be  above  the  population  average.  If  a  test  with 
a  validity  coefficient  of  .6  were  used  to  select  the  500  men  with  the 
highest  scores,  then  35-,  or  70  percent,  would  perform  above  >;he  popula¬ 
tion  average.  The  percantage  of  selected  men  above  the  population 
average  is  a  joint  function  of  the  validity  coefficient  and  the  propor¬ 
tion  of  the  population  excluded;  as  either  the  validity  or  proportion 
excluded,  or  both,  increases,  so  does  the  percentage  whose  performance 
is  above  .average. 

PREDICTION  ACHIEVED  WITH  ARMY  SELECTION  AND  CLASSIFICATION  TF.STS 

The  predictive  accuracy,  or  validity,  of  tescs  is  determined,  as 
indicated  above,  by  the  degrae  of  relationship  between  test  scores 
obtained  prior  to  training  and  performance  measures  obtained  after 
training  or  a  period  or  the  job.  Correlation  between  AFQT  score  and  job 
training  performance  is  about  .3  on  the  average.  For  the  aptitude  area 
scores,  the  corresponding  correlation  coefficients  are  about.  .6  in  rele¬ 
vant  job  training  courses.  These  predictors  have  been  improving  in 
validity  over  the  years  as  improved  tests  have  been  developed  and  incor¬ 
porated  in  the  batteries.  A  new  form  of  the  Army  Classification  Battery 
has  been  developed  that  has  an  average  correlation  coefficient  of  .65 
with  relevant  training  performance.  The  estimated  cost  benefit  to  the 
Army  from  this  increase  in  prediction  is  explained  in  Appendix  C-  The 
cost  benefit  from  operational  selection  and  classification  testing  pro¬ 
grams  in  all  services  is  described  in  Appendix  D. 


Aptitude  measures  have  lower  correlation  with  ratings  of  job  per¬ 
formance,  generally  on  the  order  of  .2  or  , J  for  the  more  valid  tests. 
Correlation  with  job  performance  ratings  Is  lower  than  with  training 
grades  for  a  variety  of  leasons.  Problems  of  sampling  the  tasks  com¬ 
prising  a  job  and  the  different  demands  placed  upon  workers  in  different 
situations  have  already  been  discussed.  Since  job  environments  are  not 
standardized,  differences  in  job  ratings  can  arise  from  sources  other 
than  true  differences  in  job  competence.  Another  reason  for  the  lower 
correlation  is  that  the  overlap  between  the  content  of  tests  and  training 
materials  i*  greater  than  between  test'  and  job  tasks,  Host  selection 
and  classif Nation  test3  are  paper-and  ,-encil  tests,  and  training  grades 
have  a  large  component  os  scores  on  paper-and-pencil  achievement  tests 
given  during  and  at  the  end  of  the  training  course,  whereas  there  is 
little  panor -and -pencil  testing  connected  with  evaluations  of  perfor¬ 
mance  on  the  job.  An  additional  and  perhaps  major  reason  for  the  lower 
correlation  is  that  qualitv  of  work  on  the  job  is  a  function  of  many 
factors  not  present  in  the  training  situation  or  at  least  much  less 
influential.  There  are  no  tests  to  predict  the  effect  of  these  factors 
on  performance  at  the  present  time.  Quality  of  performance  on  the  job 
includes  knowledge  of  specific  tasks  such  as  knowing  hew  to  repair  a 
carburetor,  ar.d  nonintellectual  factors  each  as  m stivation,  attitudes, 
willingness  to  show  up  for  work  and  desire  to  do  a  good  job.  Both 
training  and  testing  place  greater  emphasis  on  knowledge  or  cognitive 
skills  than  do  most  Army  enlisted  jobs.  The  Lowe**  prediction,  then, 
results  in  part  from  limitations  in  assessing  job  performer ce  ar.d  in 
part  from  different  demands  placed  on  the  worker  or  felt  by  him  on  the 
job — demands  rot  measured  by  the  tests. 

Prediction  of  Performance  in  Job  Training  far  Negroes  and  Whites 

A  question  of  considerable  social  importance  concerns  the  effective¬ 
ness  of  Array  selection  and  classification  tests  for  ilegroes  and  whites. 
Tests  have  come  under  severe  criticism  and  legal  attack  in  the  19oG*s  on 
the  grounds  that  they  are  discriminatory  against  Negroes  and  other 
minority  groups.  Questions  about  fairness,  equity,  or  discrimination  in 
personnel  selection,  training,  and  utilization  on  the  job  involve  issues 
that  extend  beyond  scientific  research.  What  scientific  research  can  dc 
is  answer  questions  about  how  we)  l  Negroes  and  whites  perform  on  tests 
and  on  the  criterion  and  about  the  relationship  between  test  scores  and 
later  performance  for  the  two  groups,  findings  from  scientific  research 
can  be  considered  when  making  policy  decisions  about  fairness  in  testing 
Scientific  research,  however,  doe.}  not  by  itself  determine  any  particular 
policy. 


To  answer  the  scientific  question  about  the  effectiveness  of  Army 
tests  for  Negroes  and  whites ,  test  scores  and  job  training  performance 
were  analyzed  for  over  12,000  white  and  T?Q0  Negro  enlisted  men.  The 
men  were  tested  before  their  job  training  began  and  then  followed  up 
during  training  to  determine  how  well  they  did. 

Effectiveness  of  the  tests  was  studied  in  six  job  areas:  combat 
arras  (including  infantry,  armor,  and  combat  engineering,  but  not  ifie?d 
artillery),  electronics  repair,  motor  maintenance,  general  maintenance, 
clerical-administrative,  and  general  technical.  Mean  or  average  level 
of  performance  on  the  relevant  aptitude  tests  and  in  job  training  was 
computed  for  each  racial  group  in  each  job  area.  The  predict 've  accuracy, 
or  validity  coefficient,  was  also  computed  for  each  racial  group.  These 
results  are  statistically  adjusted  to  provide  estimates  of  result;  that 
world  be  obtained  for  random  samples  of  Negroes  and  whites.  The  results 
can  therefore  be  generalized  to  the  population  of  all  young  Negro  and 
white  males  eligible  for  military  service.  The  predictive  accuracy  is 
shown  by  graphs  relating  test  scores  and  training  performance  (Figure  1). 
Table  1  g-'ves  the  data  from  which  the  graphs  were  plotted. 

The  results  for  electronics  repair  are  described  in  some  detail, 
since  they  are  typical  of  what  was  found.  The  lines  relating  aptitude 
scores  and  training  performance  for  Negroes  and  whites  are  essentially 
the  same;  this  similarity  means  that  the  tests  used  to  predict  success 
in  electronics  repair  are  about  equally  effective  for  the  two  groups. 

Since  the  differences  are  small,  the  conclusion  is  that  training  perfor¬ 
mance  for  both  groups  is  at  atn.it  tha  level  expected  from  the  test  scores. 
For  example,  both  Negroes  and  whites  with  scores  of  100,  which  is  the 
population  average,  are  expected  to  show  average  performance  in  training. 
Low-scoring  Negroes  and  whites  are  expected  to  do  less  well  in  training; 
and  the  lower  the  test  score  the  lower  the  expected  performance.  In  terms 
of  predictive  effectiveness  to  identify  men  who  can  succeed  in  training 
for  electronics  repair  jobs,  the  Army  selection  and  classifica :ion  tests 
are  equally  valid  for  Negroes  and  whites.  In  three  other  job  areas, 
motor  maintenance,  clerical-administrative,  and  general  technical,  the 
results  were  similar  to  those  for  the  electronics  repair  area. 

In  the  general  maintenance  area,  the  line  for  whites  is  somewhat 
higher  than  that  for  Negroes.  In  this  area,  Negroes  did  slightly  less 
well  in  training  than  did  whites  with  the  same  test  scores.  For  example, 
Negroes  with  a  General  Maintenance  aptitude  area  score  of  UV  have  a 
predicted  training  performance  level  of  QS,  while  whites  with  the  same 
aptitude  area  score  have  a  predicted  training  performance  level  of  100. 

In  other  words,  there  is  a  slight  tendency  for  the  aptitude  tests  to 
favor  Negroes  as  compared  to  whites  for  purposes  of  predicting  training 
performance  in  general  maintenance  jobs.  Since  the  absolute  differences 
in  level  of  predicted  performance  are  small,  and  since  the  results  for 


precilctivn  accuracy  Jtii 


Predictive  accuracy  of  classification  tests  for  predicting  performance  in  job  training  in  specific  job  areas  for 
Negro  and  White  men. 


the  other  job  areas  do  not  support  the  difference  found  for  general 
maintenance,  no  great  significance  can  be  attached  to  the  results  for 
this  one  job  area. 


In  the  combat  arms  training  courses,  the  line  relating  training 
and  test  performance  for  Negroes  is  slightly  above  that  for  whites.  In 
these  job  training  courses,  then,  Negroes  perform  slightly  better  than 
expected  from  their  test  scores,.  As  for  the  other  job  areas,  the 
differences  are  slight,  and  the  conclusion  remains  that  the  tests  work 
equally  well  for  the  two  groups. 


The  mean  level  of  test  scores  and  training  performance  is  also 
shown  in  Figure  1.  _X.  indicates  the  means  for  whites  and  the  means  for 
Negroes.  Training  performance  and  test  scores  were  set  to  have  means  of 
1GC  for  the  combined  racial  samples.  Means  were  then  computed  separately 
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for  whites  is  to  the  right  and  above  that  for  Negroes.  Whites  score 
higher  both  on  predictor  tests  and  in  training.  The  difference  in  pre¬ 
dictor  means  is  about  3  points  in  the  clerical-administrative  and  combat 
arms  areas  and  extends  up  to  10  or  11  points  in  motor  maintenance  and 
general  technical.  The  differences  in  training  performance  means  are 
smaller:  a  maximum  of  o  points  in  motor  maintenance  and  general  techni¬ 
cal;  3  to  8  points  in  c’sr leal -administrative,  general  maintenance,  and 
electronics  repair;  and  virtually  no  difference  in  combat  arms. 

The  smaller  difference  between  the  groups  in  training  performance 
is  just  what  is  to  be  expected  from  statistical  theory.  Any  group  below 
average  on  ?.  predictor  that  is  less  than  perfectly  valid  will  score 
relatively  higher  on  the  criterion,  but  still  below  average.  The  tech¬ 
nical  label  for  this  shift  in  means  is  called  ’'regression  toward  the 
■  .ear.,"  Since  most  Negroes  score  belcw  the  mean  of  whites  on  the  predic- 
;or,  their  training  performance  is  relatively  better.  Similarly,  the 
training  performance  of  s  group  of  whites  with  the  same  set  of  predictor 
scores  is  better  then  the1:  predicted  performance. 


To  summarize  the  statistical  results,  the  evidence  indicates  that 
in  terms  of  predicting  training  performance,  Army  selection  and  classi¬ 
fication  test  scores  have  the  same  meaning  for  Negroes  and  whites.  To  be 
sure,  Negroes  score  lower  on  the  predictor  tests,  but  their  training 
performance  is  correspondingly  lowr.  These  results  are  consistent  with 
those  found  by  the  Air  Force  where  job  training  was  predicted  by  tests 
about  equally  well  for  Negroes  and  whites  and  with  results  in  many 
civilian  colleges  where  tests  were  about  equally  effective  as  predictors 
of  college  success  for  Negroes  and  whites. 


The  fact  that  the  means  for  Negroes  on  tests  are  well  below  those 
cf  whites  has  led  to  suggestions  that  lower  prerequisites  for  job 
training  or  minimum  qualifying  scores  be  established  for  Negroes.  An 


alternative  way  of  phrasing  this  proposal  is  to  establish  separate  norms 
for  each  racial  group,  and  then  select  men  from  each  group  separately. 

For  example,  Che  top  50  percent  of  each  could  be  eligible  for  a  parti¬ 
cular  job  training  program.  While  this  proposal  would  insure  that  a 
certain  number  of  Negroes  would  be  accepted  for  job  training,  it  would 
by  no  means  insure  chat  many  of  them  would  complete  training  successfully. 
In  fact 3  the  evidence  indicates  that  the  failure  rate  for  Negroes  would 
be  increased  under  the  lowered  prerequisites  proposed.  With  the  present 
prerequisites,  their,  training  mean  performance  is  well  below  average, 
and  to  admit  more  man  with  lower  test  scores  would  only  serve  to  lower 
mean  performance  even  further. 

It  should  be  stressed  that  these  results  were  obtained  in  a  parti¬ 
cular  context.  Training  performance  of  Negroes  could  perhaps  be 
changed  by  changing  other  components  of  the  personnel  system.  Training 
objectives  could  bj  modified  by  fractionating  existing  jobs;  training 
methods  and  materials  could  be  adapted  to  accommodate  lower  level  stu¬ 
dents.  If  these  changes  were  made,  then  more  Negroes  could  be  accepted 
with  a  greater  likelihood  of  success.  The  point  to  be  made  is  that 
modifying  only  the  test  score  prerequisites  for  Negroes  while  leaving 
training  programs  intact  would  serve  to  increase  their  failure  rates. 

Test  scores  are  only  one  component  in  the  total  process  of  selecting, 
training,  and  utilizing  a  man  on  the  job.  An  effective  system  requires 
that  these  three  functions  complement  one  another,  and  if  one  is  adjusted 
the  other  two  must  be  brought  into  line. 

A  final  word  about  equity  or  discrimination  in  testing.  Strictly 
speaking,  discrimination  is  not  a  scientific  issue;  rather,  it  is  a 
legal  and  moral  issue.  The  scientist  can  provide  evidence  on  which 
intelligent  decisions  can  be  based.  The  scientific  evidence  presented 
here  suggests  chat  Che  issue  of  equal  employment  opportunity  is  more 
c css lex  than  merely  adjusting  test  scores  to  obtain  equal  means. 

Othar  Evidence  ef  Jwpertsacs  of  AFQT  Scores 

In  addition  to  predicting  success  in  job  training,  selection  and 
classification  testa  have  been  found  to  be  related  to  other  measures  of 
success  in  life-in  fact,  to  a  variety  of  indexes  of  social  and  job 
status.  For  example,  surveys  of  Army  personnel  consistently  shew  that 
AFQT  score  bears  2  strong  relationship  to  enlisted  grade.  Men  scoring 
higher  on  the  test  get  promoted  faster  and  they  tend  to  reach  higher 
grades.  In  a  survey  of  men  separatees  from  the  Army  after  frmn  ig  to  2s 
months  of  service,  the  percentage  of  men  at  grade  E-5  and  above  for 
different  levels  of  AFQT  seers  and  controlled  by  race  and  level  cf 
education  were  as  follows: 


AFQT 

Non  High  School 

High  School 

Race 

Score 

Graduate 

Graduate 

White 

21  •*  50 

26$ 

34$ 

?1  -  80 

33$ 

50$ 

91  -100 

45$ 

59$ 

Negro 

21  -  50 

21$ 

30$ 

71  -80 

31$ 

40$ 

91  -100 

33$ 

58$ 

The  percentage  of  E-5'e  and  higher  increased  steadily  with  AFQT  score 
for  both  Negroes  and  whites  and  for  both  levels  of  education. 

The  Department  of  Defense  analyzed  the  post-service  earnings  of 
enlisted  men  ten  months  after  separation  from  the  Army.  For  both  Negroes 
and  whites,  mean  income  showed  a  stead/  rise  with  higher  AFQT  score.  For 
example,  for  whites  in  the.  AFQT  score  range  2G-29  assessed 

income  W3S  about  $6500  compared  to  $7400  for  whites  ix.  the  80-89  score 
range.  For  Negroes,  the  comparable  figures  were  $6200  and  $7000.  Unem¬ 
ployment  figures  showed  a  similar  trend.  For  whites  scoring  10  to  50  on 
the  AFQT,  7*3  percent  were  unemployed  ten  months  after  separation  as 
compared  to  5.2  percent  of  m«.n  scoring  93  to  100.  For  Negroes  the 
figures  were  12.0  percent  and  2.8  percent,  respectively. 

The  Army  in  late  I97I  set  new  policy  on  the  granting  of  moral  waivers. 
Under  the  revised  policy,  a  person  with  a  felony  conviction  and  who  dropped 
out  of  high  school  or  scored  below  51  an  the  AFQT,  or  both,  is  not  ordi¬ 
narily  granted  a  moral  waiver.  .Persons  with  a  felony  conviction  who  are 
high  school  graduates  and  score  above  50  on  the  AFQT  can  be  granted  waivers. 
The  decision  to  use  AFQT  score  as  part  of  the  basis  for  granting  moral 
waivers  is  supported  by  years  of  research  and  experience  in  which  the 
consistent  result  is  that  men  scoring  low  on  AFQT  have  a  greater  tendency 
to  become  disciplinary  cases. 


Systems  Analysis  Of  Testing 

Army  selection  and  classification  tests  have  been  shown  through  years 
of  research  and  operational  use  to  fulfill  the»r  intended  purpose--to  ob¬ 
tain  optimal  performance  from  available  manpower  resources.  The  tests  have 
consistently  been  found  to  be  related  to  valid  measures  of  potential  to 
perform,  and  thus  have  enabled  more  intelligent  personnel  decisions. 

Effective  management  of  manpower  resources  always  occurs  in  a  specific 
context  or  system.  Thus,  the  validity  of  the  tests  has  been  established 
for  a  particular  job  training  program,  which  in  turn  was  designed  to 
qualify  workers  for  a  specified  job  structure.  These  three  components  of 
the  personnel  system — selection,  training,  and  utilization  on  the  job-- 
are  critical  dimensions;  as  one  changes,  the  others  are  also  affected. 


Put  another  way,  a  test  has  validity  only  for  certain  criteria  and  not 
for  others.  Selection  and  classification  in  ‘:he  Army  traditionally  have 
been  used  to  identify  men  with  the  greatest  potential  to  succeed  and 
generally  to  reject  men  with  the  least  potential. 

?n  the  late  1960's,  the  Department  of  defense  formulated  policy  on 
human  goals  that  could  have  implications  for  the  way  teste  are  used  in 
the  future.  One  of  the  goals  was  "to  contribute  to  the  improvement  of 
our  society,  including  its  disadvantaged  rrc-mbevs .  by  greater  utilization 
of  our  human  and  physical  resources  while  maintaining  full  effectiveness 
in  the  performance  of  our  primary  mission. This  goal  suggests  that 
personnel  management  in  the  military  servjVefi  must  consider  not  only 
effective  performance,  but  also  the  impact  of  personnel  decisions  on  all 
elements  of  American  society.  In  a  society  where  training  end  promotion 
were  b'sed  on  merit,  tests  served  well.  In  a  new  context  where  greater 
utilization  of  manpower  resources  from  all  walks  of  life  is  a  goal,  back¬ 
ground  characteristics  may  need  to  be  considered  in  making  personnel 
decisions,  and  order  of  merit  as  identified  by  test  scores  may  no  longer 
be  the  controlling  factor. 

An  important  change  in  the  effectiveness  of  tests  has  beer,  found 
when  they  are  used  to  predict  success  in  the  new  type  of  Army  job 
training  called  "performance-referenced  training."  In  the  new  training, 
a  limited  number  of  tasks  are  specified  as  essential  to  a  job.  All  men 
assigned  to  this  type  of  training  art  trained  to  perform  at  a  minimal 
level  of  competence-~the  mastery  level — and  are  *-hen  graduated  as  quali¬ 
fied  workers.  Differences  in  performance  above  mastery  level  are  not 
considered  in  the  evaluation  of  tra:  i?ing  outcomes.  In  contrast  ,  tradi¬ 
tional  job  training  has  placed  a  high  ceiling  on  what  a  man  should  know 
after  training.  In  a  sense,  the  traditional  style  of  instruction  was 
designed  to  develop  in  parh  person  the  maximal  level  cf  skill  and  know¬ 
ledge  commensurate  with  his  aptitudes  rather  than  being  concerned  mainly 
with  acquisition  of  minimal  skills.  Validity  of  the  Army  test  was  estab¬ 
lished  for  the  traditional  type  of  training. 

In  the  new  type  of  training  program,  tests  would  be  expected  to 
have  little  or  no  validity  in  predicting  level  of  performance.  Tests 
can  have  validity  only  if  there  are  differences  in  levels  of  performance. 
Since  differences  in  performance  a*. e  not  assessed  in  performance-re fer=- 
enccd  training,  the  criterion  of  success  is  irrelevant  for  determining 
the  effectiveness  of  aptitude  rest • ,  Thus,  the  fact  that  completion  of 
performance-referenced  training  is  unrelated  to  test  scores  in  no  way 
reflects  on  the  validity  of  the  tests.  Since  there  are  differences  in 
time  required  to  attain  mastery  level  of  performance,  the  tests  remain 
valid  for  predicting  which  men  can  learn  the  material  the  fastest. 

The  new  training  program  c.-uid  also  require  reconsideration  of  how 
men  are  utilized  on  the  job.  As  was  discussed  earlier,  traditional 
training  courses  serve  the  tun* *  ion  of  additional  selection  of  men 
through  extensive  observation  ar.d  evaluation  of  performance  during  the 


period  of  instruction.  If  everyone  completes  training  satisfactorily, 
then  this  additional  selection  would  need  to  be  .performed  subsequently 
on  the  job.  Since  at  present  the  job  environment  has  few  procedures  for 
performing  selection,  procedures  would  need  to  be  developed  to  identify 
men  having  difficulty  in  adjusting  or  coping  with  the  tasks  encountered 
on  the  job  and  to  effect  transfers  to  different  jobs  when  necessary. 

Even  though  tests  would  lose  their  validity  for  predicting  success 
in  performance-referenced  training,  they  would  still  be  valid  for  pre¬ 
dicting  performance  during  the  initial  learning  phases  on  the  job.  If 
the  men  are  taught  skills  that  involve  reasoning  and  memory,  aptitude 
tests  will  predict  both  level  of  performance  and  ease  of  acquiring  the 
skills.  Jobs  differ  iu  the  level  and  complexity  of  demands  imposed  on 
the  incumbents,  and  the  differences  between  individuals  in  ability  to 
perform  will  emerge,  When  che  criterion  situation  allows  differences 
between  individuals  to  surface,  then  the  differences  in  aptitude  test 
scores  will  be  related  to  level  of  performance. 


The  interrelationship  of  job  level,  training  and  mental  ability  .in 
the  current  Army  personnel  system  is  shown  in  Figure  2.  Four  job  levels 
.are  considered — commissioned  officer,  noncommissioned  officer,  skilled 
journeyman,  and  unskilled  worker.  The  commissioned  officer  level  is  the 
most  complex  and  requires  the  highest  level  of  mental  ability,  ordinarily 
in  the  top  third  of  the  population.  Technical  skills  acquired  through 
experience  ordinarily  cannot  be  substituted  for  high  mental  ability, 
especially  at  senior  officer  level.  At  noncommissioned  officer  level, 
most  men  are  in  the  top  half  of  the  population  in  mental  ability.  In 
some  cases,  noncommissioned  officers  with  lower  mental  ability  than  is 
ordinarily  required  have  been  successful.  Extensive  training  or  exper¬ 
ience  can  sometimes  compensate  for  lower  ability,  especially  in  less  tech¬ 
nical  areas,  such  as  combat  arms.  At  the  entry  levels  of  most  Array  enlisted 
jobs,  rigorous  training  can  prepare  most  men  to  perform  adequately.  How¬ 
ever  future  leaders  and  supervisors  in  the  technical  areas  are  ordinarily 
drawn  from  among  men  with  above  average  general  ability. 

The  usefulness  of  tests  for  men  who  are  overage  or  higher  in  ability 
is  generally  well  accepted.  For  example,  selection  test  are  widely  used 
to  pick  college  students  and  cotnaissioned  officers.  There  is  some  concern, 
however,  that  the  tests  are  not  useful  for  men  with  low  ability.  The 
criticism  is  often  based  on  the  fact  that  most  Army  tests  are  of  the  paper- 
and-pencil  variety.  The  concern  is  that  less  able  men  lack  familiarity 
with  the  verbalization  in  the  tests  and  perhaps  are  not  even  accustomed 
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Figure  2.  Relationship  among  job  level,  and  job  training,  and  aptitude  requirements  in  the  Army 
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Experience  and  training  ca: 
be  used  on  substitutes  for 
aptitude,  provided  functi¬ 
onal  literacy  level  is 
attained 


to  using  pencils.  However,  the  statistical  evidence,  presented  earlier 
about  the  validity  02  aptitude  tests  for  Negroes  and  whites  shows  that 
paper-aisd-pencil  tests  are  effective  predictors  in  traditional  job 
training  programs  for  men  of  low  ability.  For  these  men,  there  is 
cone latency  of  performance  between  test  end  training  situations.  Such 
consistency  is  the  mark  of  validity.  Whatever  factors  operate  to  hold 
down  test  performance  for  these  men  also  depress  training  performance. 

In  sunrnary,  Army  tests  have  been  in  use  for  many  years.  Initially, 
there  was  resistance  to  their  operational  use.  Then,  with  demonstrated 
results  in  the  form  of  improved  selection  and  assignment,  a  general 
acceptance  grew  up.  Individual  points  of  criticism  were  made  and  these 
have  at  times  resulted  in  improvements.  In  recent  years,  tests  have 
come  under  increased  criticism  both  in  and  out  of  the  Army.  The  criti¬ 
cisms  have  prompted  the  testing  research  community  to  reexamine  many 
long-stap-din®  belief?  and  practices.  The  likely  outcome  is  that  tests 
will  continue  to  be  needed  and  to  be  used,  but  the  circumstances  of 
their  use  and  in  some  cases  the  tests  thsmselves  will  adjust  to  changing 
concepts  of  their  role. 
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APPENDIX  A 


THE  ARMED  10RCES  QUAUFICATIOR  TEST 

(AIQT) 


InforitigtioR  Pamphlet 


Not  to  be  reproduced  in  cny  form 
without  the  specific  permission  of  the 

COMMANDING  OFFiCER 
U.S.  ARMY  PERSONNEL  RESEARCH  OFFICE 
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ARMED  FORCES  QUALIFICATION  TEST 
INFORMATION  PAMPHLET 


PURPOSE  OF  THIS  PAMPHLET 

This  pamphlet  has  been  prepared  primarily  to  give  general  informa¬ 
tion  about  the  AFQT — the  test  you  will  take  at  the  Armed  Forces  Exam¬ 
ining  Station.  AFQT  is  an  abbreviation  for  Armed  Forces  Qualification 
Test,  which  is  the  mental  ability  test  all  men  have  to  take  before 
going  into  one  of  the  Armed  Services, 

It  is  hoped  that  after  reading  this  pamphlet  you  will  understand 
better  the  purpose  and  make-up  of  the  AFQT  and  thus  be  more  at  ease 
when  you  take  the  te3t. 

WHY  AN  AFQT? 

The  AFQT  has  been  compared  to  a  screen — a  mental  screen.  It  sorts 
men  in  terms  of  their  ability  level  and  throws  back  or  rejects  those 
who  do  not  come  up  to  the  level  required  by  the  Armed  Services.  Why 
should  all  men  be  required  to  take--and  pass--such  a  screen  before  they 
can  be  accepted  for  service?  Isn't  a  physical  examination  enough? 

Can't  anyone  who  is  healthy  be  a  soldier?  The  answer  is  "No."  It  takes 
more  than  physical  ability  to  learn  the  hundreds  of  details  in  basic 
training,  and  in  advanced  individual  training,  and  to  learn  to  do  the 
many  tasks  that  make  up  the  job  of  today's  soldier.  The  ability  to 
learn  these  things  ia  what  the  AFQT  measures. 

WHAT  IS  THE  AFQT? 

The  AFQT  has  been  called  i  test  of  "military  trainability"  for  the 
logical  reason  that  men  who  <  j  well  on  it  can  absorb  military  training 
without  oirficulty  and  learn  to  uo  a  good  job  in  the  Armed  Forces.  A 
man  does  not  have  to  be  a  genius  to  pass  the  AFQf — or  to  do  well  on  it. 

It  is  not  an  "I.Q."  test.  It  is  not  an  "education"  test,  although 
those  who  have  doue  well  in  school  usually  make  good  scores  on  the  AFQT. 
But  there  are  also  men  who  did  not  go  very  far  in  school  who  do  well 
on  the  AFQT. 

DO  YOUR  BEST 

Congress  has  set  a  minimum  score  for  qualifying  on  the  AFQT.  But 
just  barely  passing  won't  do  much  for  a  man  other  than  get  him  a  uniform. 
In  order  to  be  selected  for  training  in  specialist  schools,  or  for 
Officer  Candidate  School,  or  for  promotion  to  the  mote  interesting  and 
better  paying  jobs,  a  man  needs  to  demonstrate  more  ability  than  just 
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being  able  to  pass.  Whether  you  want  to  make  progress  in  the  military 
or  develop  skills  which  will  be  useful  to  you  later  in  civilian  life— 
you  should  do  your  best  and  make  the  highest  score  you  can  on  the  AFQT. 
And  you  will  have  only  one  chance  to  do  this.  Men  are  not  given  the 
AFQT  a  second  time  just  because  they  did  poorly  the  first  time.  So  do 
the  very  best  you  can  on  it. 

TAKING  THE  AFQT 

When  you  go  to  the  Armed  Farces  Examining  Station  to  cake  the  AFQT} 
you  will  be  giver  a  test  booklet  containing  several  practice  questions 
and  100  test  questions.  You  will  also  be  given  a  separate  answer  sheet 
where  you  will  mark  your  answers  to  the  test  questions.  A  special 
pencil  with  an  eraser  will  be  provided  along  with  blank  paper  where  you 
can  do  any  figuring  you  want. 

You  will  receive  complete  instructions  as  to  what  to  do  in  taking 
the  test.  Some  of  these  instructions  are  printed  in  the  test  booklet; 
all  are  read  aloud  by  the  examiner  conducting  the  test.  You  will  be 
told  how  to  mark  your  answers  on  the  answer  sheet  in  the  proper  spaces. 

If  you  make  a  mistake  and  mark  the  wrong  answer  you  can  erase  it  and 
mark  a  different  one. 

After  yon  heve  been  told  what  to  do  to  answer  the  questions f  you 
practice  by  answering  several  sample  questions.  You  are  told  the 
correct  answers  for  these  practice  questions.  Finally ?  you  are  given 
an  opportunity  to  ask  questions  of  the  examiner  if  you  do  not  understand 
what  is  expected  of  you. 

THE  TEST  AND  THE  ANSWER  SHEET 

You  will  find  four  types  of  questions  in  the  AFQT.  There  are  ques¬ 
tions  about  the  meaning  of  words,  questions  about  arithmetic,  questions 
about  tools }  and  questions  about  boxes  made  by  folding  pieces  of  card¬ 
board.  Sample  questions  of  each  of  these  types  are  shown  on  the  follow¬ 
ing  pages.  Therf  arc  25  questions  of  each  type  or  a  total  of  100  ques¬ 
tions  in  the  test.  Fifty  minutes  are  allowed  for  taking  the  test.  Not 
everyone  is  able  to  finish  the  test  in  this  time ,  but  you  can  make  a 
good  score  even  without  finishing  it. 

Each  question  in  the  AFQT  is  followed  bv  four  possible  answers 
labeled  A,  B5  C,  and  D.  Only  one  of  these  i-s  right.  Your  job  is  to 
decide  which  of  the  answers  is  right  and  than  mark  the  answer  sheet  (not 
the  test  booklet)  to  show  yaur  choice. 

There  are  5  main  parts  to  the  answer  sheet.  Or.e  part  provides  space 
for  your  name?  the  date?  and  ccher  such  information.  A  second  part 
contains  a  space  with  heavy  lines  a round  it  labeled  "Practice  Questions.” 


This  is  for  the  answers  to  the  practice  questions  which  everyone  takes 
before  beginning  the  100  questions  that  count.  The  rest  of  the  answer 
sheet--the  main  part--is  numbered  from  1  to  100  to  provide  spaces  for 
the  answer  to  each  of  the  100  test  questions. 

HOW  TO  ANSWER  THE  QUESTIONS 

After  each  question  in  the  test  booklet  are  four  possible  answers 
next  to  the  letters  A,  B,  C,  and  D,  On  the  answer  sheet  after  each 
question  number  are  the  four  letters,  A.  B,  C,  D,  printed  in  tall  thin 
type.  You  blacken  out  one  of  these  letters  on  the  answer  sheet  to 
indicate  your  answer  to  each  question.  Here  is  the  first  Practice 
Question  in  the  test: 

PI.  It  was  a  small  ttble. 

A)  sturdy 

B )  round 

C)  cheap 

D)  little 

The  question  here  is:  Which  of  the  four  words — A)  sturdy,  B) 

round,  C)  cheap,  D)  little — means  the  same  as  the  word  with  the  line 
under  it,  the  word  small? 

The  D  answer,  little,  means  the  same  as  small,  so  on  the  answer 
sheet  in  the  box  labeled  Practice  Questions  and  on  line  number  PI,  you 
would  blacken  out  the  letter  D.  The  answer  sheet  would  then  look  like 
this : 


PRACTICE  QUESTIONS 

PI  S 

8  L 

I 

P2  n 

8  C 

fl 

P3  8 

8  [ 

| 

P4  8 

8  i 

P5  ii 

8  i 

ii 

The  next  practice  questi^n^  P2,  is  about  arithmetic. 

P2.  One  box  of  nails  costs  30^ 

How  much  do  3  boxes  cost? 

A)  35^ 

B)  60flf 

C)  9CM 
d)  99* 

The  right  answer  is  the  C  answer,  90*.  so  on  the  answer  sheet  the 
C  on  line  P2  would  be  blackened  out. 

The  third  practice  question  is  a  picture  question  about  tools. 
Picture  questions  go  across  the  page.  The  question  is  always  about  the 
first  picture  in  the  row.  You  have  to  decide  which  one  of  the  other  4 
pictures  in  the  same  row  goes  best  with  the  first  picture. 


The  first  picture  above  shows  a  valve  used  to  turn  water  on  and  off, 
The  picture  tiat  goes  best  with  the  valve  is  the  B  picture,  the  piece  of 
water  pipe,  so  the  B  on  ihe  P3  line  of  the  answer  sheet  should  be  black¬ 
ened  out. 

There  are  two  main  kinds  of  questions  about  boxes.  In  one  the  job 
is  to  find  the  box  that  can  be  made  by  folding  a  piece  of  cardboard.  In 
the  other  kind  the  question  is  what  the  piece  of  cardboard  would  look 
like  if  the  box  were  unfolded .  The  row  of  pictures  right  below  shows 
what  is  meant  by  folding  a  piece  of  cardboard  to  make  a  box. 


CARDBOARD  BCK 


The  dotted  lines  show  where  folds  are  to  be  made.  The  last  picture 
shows  the  box  that  has  been  made  by  the  folding. 
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Now  look  at  practice  question  P4  below.  The  first  picture  shows 
the  piece  of  cardboard  that  is  to  be  folded.  You  are  to  find  the 
drawing  of  the  box  A,  3,  C,  or  D  that  would  be  made  by  folding  this 
cardboard. 


The  cor7:eet  answer  is  B.  So,  on  the  answer  sheet ,  the  letter  B  on 
the  P4  line  would  be  blackened  out. 


In  the  next  question,  P5,  if  the  box  in  the  first  drawing  were 
unfolded  it  would  look  like  one  of  the  four  cardboard  patterns  shown 
in  the  other  pictures  in  the  row.  Which  one? 


The  correct  answer  is  A. 

If  you  marked  all  the  practice  questions  correctly,  the  Practice 
Box  on  the  answer  sheet  would  look  like  this: 


jpRACTJCE  QUESTIONS 

pi  a  t  c  i 

P2  fi  o  i  o 
P3fi  I  [  S 

P4  e  i  c  o 

p5 1  B  C  0 


A  "SAMPLE"  AFQT 


The  instructions  and  practice  questions  above  are  the  same  as  those 
given  to  all  men  taking  the  AFQT  at  the  Armed  Forces  Examining  Stations. 


The  next  4  pages  contain  a  sample  of  16  questions  like  those  in  the 
AFQT — four  questions  of  each  of  the  four  types  in  the  current  test.  Try 
them  now  for  practice.  When  you  answer  them  use  the  answer  space  in  the 
middle  of  this  page.  You  will  probably  notice  that  the  questions  get 
harder  as  you  go  from  the  first  to  the  fourth  in  each  type.  This  is  the 
way  the  AFQT  is  arranged — easy  questions  first  then  harder  as  you  go 
alciig.  About  nine  out  of  ten  met  get  the  first  question  right.  About 
one  out  of  three  of  those  taking  the  AFQT  are  able  to  get  the  last  one 
right.  How  many  can  you  answer  correctly! 

Mark  your  answers  here: 


The  correct  answers  to  all  of  the  16  questions — just  in  case  you 
want  to  check — are  on  the  last  page  of  this  pamphlet. 

SAMPLE  AFQT  QUESTIONS 

1.  The  boys  discovered  the  cave. 

A)  searched 

B)  found 

C)  enlarged 

D)  entered 

2.  There  will  be  variable  winds. 

A)  shifting 

B)  chilling 

C)  steady 

D)  mild 

3.  He  is  a  man  of  great  vigor. 

A)  wickedness 

B)  strength 

C)  reputation 

D)  wisdom 
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proved  correct. 


4.  The  prognostics t ion 

A)  solution 

B )  measurement 

C)  identification 

C)  prediction 

5.  Bob  wants  to  buy  a  wagon.  He  has  5  dollars  and  needs  5  dollars 
more.  How  much  does  the  wagon  cost? 

A)  $10 

B)  $15 

c)  $25 

D)  $55 

6.  If  12  men  are  needed  to  run  4  machines  how  many  men  are  needed  to 
run  20  machines? 

A)  24 

B)  48 

C)  60 

D)  80 

7.  During  one  year  the  fruit  crop  was  500  bushels.  The  next  year  the 
crop  increased  102  percent.  How  many  bushels  were  produced  in  both 
years? 

A)  510 

B)  602 

C)  10 10 

D)  1510 

8.  If  John  can  stack  2/5  of  a  cord  of  wood  in  1  hour  how  long  will  it 
take  him  to  stack  4  cords  of  wood? 

A)  6  hrs 

B)  5  hrs 

C)  4  hrs  20  mins 

D)  2  hrs  40  mins 


(The  correct  answer  to  each  question  has  been  blackened  out) 
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A  FINAL  WORD 

If  you  have  read  each  page  of  this  pamphlet  and  have  answered  the 
sample  questions— -or  as  many  of  them  as  you  could-*>yo!i  should  be  well 
prepared  for  taking  the  AFQT  at  the  Armed  Forces  Examining  Station.  You 
may  not  be  able  to  answer  every  question  correct ly—very  few  people 
are — but  you  will  be  able  to  understand  what  is  expected  of  you.  And 
you  should  feel  at  esse  because  you  will  be  taking  a  test  that  you  are 
familiar  with — one  on  which  you  will  be  able  to  make  a  score  that  will 
do  full  credit  to  your  ability. 
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APPENDIX  3  ACB  AND  AQB  SAMPLE  QUESTIONS 


ACB  and  AQB  SAMPLE  QUESTIONS 

The  following  pages  contain  sample  questions  for  tests  composing 
the  Army  Classification  Battery  (ACB)  and  the  Army  Qualification  Battery 
(AQB).  Included  within  the  AQB  are  four  tests  which  compose  the  Armed 
Forces  Qualification  Test  (AFQT).  The  four  AQB  tests  used  for  the  AFQT 
score  are  Verbal  (VE),  Arithmetic  Reasoning  (A R),  Shop  Mechanics  (SM), 
and  Pattern  Analysis  (PA). 

The  AFQT  is  the  basic  screening  instalment  used  for  both  enlistment 
and  induction.  The  AQB  is  used  a°  «*n  additional  screening  instrument 
for  some  inductees  and  as  a  qualifying  instrument,  for  men  who  want  to 
enlist  with  the  assurance  of  training  in  a  particular  occupational  area 
or  initial  assignment  to  a  particular  overseas  command.  The  ACB  is  a 
set  of  tests  paralleling  those  of  the  AQB,  but  with  more  questions  that 
discriminate  among  the  higher  level  men.  The  ACB  is  used  to  identify 
the  areas  in  which  a  man  is  better  suited  so  thaL  he  may  be  assigned  to 
a  job  where  he  is  more  likely  to  succeed. 

The  sample  questions  are  representative  of  items  in  the  AQB  and 
ACB.  The  batteries  represented  and  the  number  of  items  in  each  test  are 
indicated  at  the  top  of  each  page.  Organisation  of  the  sample  questions 
by  test  and  battery  is  shown  below. 


Test 

Battery 

VE 

ACB, AQB, AFQT 

AR 

ACB, AQB .AFQT 

SM 

ACB 

SM 

AQB, AFQT 

PA 

ACB 

PA 

AQB, AFQT 

MA 

ACB, AQB 

Test 

Battery 

M 

ACB, AQB 

EI.I 

ACB 

EI.I 

AQ3 

Cl 

ACB ,AQB 

GIT 

ACB .AQB 

ACS 

ACB,AOB 

ARC 

ACB 
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VERBAL  TEST  (VE) 
SAMPLE  QUESTIONS 


ACB:  50  items  AFQT  and  AQB:  25  items 


Each  item  requires  the  examinee  to  select  the  correct  synonym  for 
the  underlined  word  in  a  short  sentence. 


1.  The  boys  discovered  the  cave. 

A)  searched 

B)  found 

C)  enlarged 

D)  entered 


2.  There  will  be  variable  winds. 

A)  shifting 

B)  chilling 

C)  steady 

D)  mild 


3.  He  is  a  man  of  great  vigor. 

A)  wickedness 

B)  strength 

C)  reputation 

D)  wisdom 


4.  The  prognos  t ica  t ion  proved 
correct. 

A)  solution 

B)  measurement 

C)  identification 

D)  prediction 
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ARITHMETIC  REASONING  TEST  (AR'j 
SAMPLE  QUESTIONS 


ACB:  40  items 


AF()T  and  AQB:  25  items 


Each  item  ia  a  reasoning  problem.  involving  application  of  arithmetic 


processes . 


1.  Bob  wants  to  buy  a  wagon. 
He  has  5  dollars  and  needs 
5  dollars  more.  How  much 
does  the  wagon  cost? 


A)  $10 

B)  $15 

C)  $25 

D)  $55 


If  12  men  are  needed  to  run 
4  machines,  how  many  men  are 
needed  to  run  20  machines? 

A)  24 

B)  48 

C)  €0 

D)  80 


During  one  year,  the  fruit 
crop  was  500  bushels.  The 
next  year,  the  crop  increased 
102  percent.  How  many 
bushels  were  produced  in 
both  years? 

A)  510 

B)  502 

C)  1C  10 

D)  1510 


If  John  can  stacx  2/5  of  a 
cord  of  wood  in  1  hour,  how 
long  will  it  take  him  to 
stack  4  cords  of  wood? 

A)  S  hrs 

B)  5  hrs 

C)  4  hrs  20  rains 

D)  2  hrs  40  mins 


SHOP  MECHANICS  TEST  (SM):  40  items 

SAMPLE  QPESTIONS 


Questions  are  directed  toward  drawings  of  tools  or  illustrations  of 
son leal  principles  involved  in  shop  work. 


The  tool  normally  used  to  pry 
boards  loose  and  to  pull  nails 
from  construction  forms  is 


SE 


3.  The  tool  ordinarily  used  by 
a  draftsmn  to  transfer 
measurements  from  one  drawing 
to  another  is 


The  wrench  combination  used  to 
tighten  the  pipe  connection  is 


4.  The  tool  ordinarily  used 

in  a  miter  box  to  cut  angles  is 


SHOP  MECHANICS  TEST  (SM):  25  items 
SAMPLE  QUESTIONS 


Each  item  involves  pictures  of  tools  or  other  implements  used  in 
the  shopvork  of  various  trades.  A  question  picture  on  the  left  is  to  be 
matched  with  one  of  four  answer  pictures  to  its  right  on  the  basis  of 
the  one  with  which  it  goes  best. 


PATTERN  AKALVSIS  TEST  ( M } :  21?  v.vbA 
samite  question? 

Each  item  is  comprised  cf  a  question  pictuT?-  on  the  left  and  four 
answer  pictures  to  tjte  right.  The  question  picture  it  either  a  two 
dimensional  pattern  that  could  be  folded  into  one  of  the  four  figures 
shown  to  the  right,  or  it  is  a  three  dimern> icnel  figure  that  i.ould  be 
formed  by  folding  one  at  four  patterns  shown  t.o  its  right.  Some  problems 
involve  basic  figures,  such  as  cubes,  with  varying  degrees  of  difficult 
surface  designs  and  pattern  layouts.  Other  problems  involve  increasingly 
difficult  odd-shaped  figures  and  patterns. 


b 

.  :  L.zr_ 


UAL 


MECHANICAL  APTITUDE  TEST'  (MA) 
SAMPLE  QUESTIONS 


ACB:  45  items 


AQB:  20  items 


Questions  are  asked  about  illustrations;  of  mechanical  problem 
situations  that  call  for  an  understanding  o'!  physical  principles. 


1.  Which  upright  supports  the 
greater  part  of  the  load? 


2.  Wh’ch  wheelbarrow  is  HARDER 
t;  lift? 


3.  Which  chain  is  helping  to 
hold  up  the  sign? 


B)  B 

C)  Both 

D)  Neither 

4.  The  gears  of  this  rotary  oil 
pump  fit  very  closely  to  the 
sides  and  end  of  the  case  and 
against  each  other.  If  the 
gears  turn  in  the  ’irection 


130  Lbs. 


150  Lbs. 


AUTOMOTIVE  INFORMATION  TEST  (Al) 
SAMPLE  QUESTIONS 


ACB :  40  items 


AQB:  20  items 


Items  involve  the  identification  and  operation  of  automobile  parts 
through  descriptions  and  through  pictures  or  diagrams. 


1.  Cross  members  are  part  of  the 

A)  shock  absorbers 

B )  frame 

C )  hanger 

D)  auxiliary  spring 


2.  In  the  illustration  below, 
which  part  is  closest  to  the 
valves? 

A)  1 

B)  2 

C)  3 

D)  4 


3.  Jerky  spplicacion  of  power 
to  the  crankshaft  is 
eliminated  by  the 

A)  crankcase 

B)  main  springs 

C)  shock  absorbers 

D)  flywheel 


4.  The  function  of  the  rotor 

is  to 

A)  open  and  close  the 
distributor  points 

B)  rotate  the  distributor 
cam 

C)  distribute  electricity 
to  the  spark  plugs 

D)  rotate  the  distributer 
shaft 


ELECTRONICS  INFORMATION  TEST  (ELI):  40  item" 
SAMPLE  QUESTIONS 


This  test  contains  an  equal  number  of  both  verbal  items  and  picture 
items  calling  for  a  knowledge  of  electronic  principles.  The  picture  items 
require  the  examinee  to  associate  electronically  a  pictured  cbjcc^  with 
one  of  four  other  pictured  objects. 


f-i  L 


-•1=^ 


3-  An 

electric  switch  is  used 

4.  The  device  used  to  conduct 
current  from  armature  coils 

A) 

only  to  break  a  circuit 

to  the  brushes  of  an  a.c. 

B) 

only  to  complete  a  circuit 

generator  is  a 

c) 

either  to  break  or  to 

complete  a  circuit 

A)  catchall 

D) 

to  change  the  direction  of 

B)  rotor 

flow  in  a  circuit 

C)  stator 

D)  slip  ring 

CLASSIFICATION  INVENTORY  (Cl) 
SAMPLE  QUESTIONS 


ACB:  12p  items  AQB:  75  items 


The  test  consists  of  self-description  itetas  for  which  the  examinee 
is  to  indicate  the  choice  that  most  closely  reflects  his  personal  back¬ 
ground,  experiences ,  attitudes,  and  opinions  of  himself.  The  items  were 
empirically  selected  to  predict  combat  effectiveness. 


1.  The  job  of  explorer.  "*>. 

A)  I  think  I  would 
like  the  job. 

B)  I  think  I  would 
dislike  the  job. 


2.  Which  would  you  like  better? 

A)  going  to  see  a  mystery 
movie 

B)  going  to  a  boxing 
match 


I  have  never  done  anything 
dangerous  for  the  thrill  of  it. 

A)  Yes,  this  is  true  of  me. 

B)  No,  this  is  not  true  of  me. 


4.  Which  describes  you  the  LEAST? 

A)  liberal  with  money 
3)  look  on  all  sides  of  a 
question 

C)  religiously  exact 

D)  respectful 

E)  smooth  in  movements 
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GENERAL  INFORMATION  TEST  (GIT) 
SAMPLE  QUESTIONS 


ACB:  50  items 


AQB:  30  items 


This  is  an  objective  test  of  knowledge  in  a  variety  of  areas  such  as 
the  Amy,  sports,  guns,  and  automobiles. 


JL. 


The  stripes  on  a  soldier's 
sleeve  that  show  hew  long 
he  has  been  in  the  service 
are  called 

A)  crimp  checks 

B )  hash  marks 

C)  rockers 

D)  hitch  lines 


5- 


2. 


The  "No.  00  Buck"  is 

correctly  used  in  a 

A)  .30  •  06  rifle  against 
a  moose  at  long  range 

B)  16  -  gauge  shotgun 
against  pheasants 

C)  20  -  inch  barrel  carbine 
against  a  deer  in  dense 
cover 

D)  12  -  gauge  shotgun 
againct  a  nearby  bear 


4. 


A  regulation  baseball 
diamond  is  a  90-foot 
square;  a  softball 
diamond  is  a 

A)  60  -  foot  square 

B)  75  -  foot  square 

C)  90  -  foot  square 

D)  120  -  foot  square 


The  intake  and  exhaust 
valve  stems  in  an 
automobile  engine  are 
driven  by  the 

A)  transmission 

B)  crankshaft 

C)  camshaft 

D)  drive  shaft 
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ARMY  CLERICAL  SPEED  TEST  (ACS) 


ACB:  110  items 


AQB:  110  items 


Number  Reversal:  BO  items 

The  examinee  is  to  blacken  the  space  under: 

"R"  (for  RIGHT)  if  the  numbers  are  exactly  reversed. 
or  ,:W"  (for  WRONG)  if  the  numbers  are  not  exactly  reversed. 


R  ’  W 


1. 

0874  ::: 

”"4730 

it 

W 

2. 

0159560 

0659510 

Coding:  50  items 


The  examinee  is  to  find  the  number  that  goes  with  the  question  word 
by  looking  for  the  word  and  number  pair  in  the  key,  then  mark  his  answer 
for  each  question.  Question  3  is  already  marked. 


KEY: 


auto  .  .  .  2715  house . 3451 

bread  . 1413  train . 2864 


3- 

train 

1413 

2715 

2654 

3451 

4. 

bread 

1413 

2715 

2C$4 

3451 

j  • 

auto 

1413 

2715 

2864 

3451 

6* 

train 

1413 

2715 

2664 

3451 

house 

1413 

2715 

2864 

3451 

7- 
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APPENDIX  C  VALUE  OF  IMPROVED  EFFECTIVENESS 

OF  CLASSIFICATION  TESTS 


A  new  classification  battery  has  been  developed  that,  compared  to 
the  old  battery,  improves  the  accuracy  with  which  potential  for  training 
performance  is  assessed.  The  old  battery  yielded  a  gain  of  about  55 
percent  of  the  maximum  possible  gain  in  performance  that  could  be  realized 
with  perfectly  accurate  measurement.  (Maximum  possible  gain  is  explained 
in  the  main  body  of  the  present  report,  pages  5  through  7.)  The  new 
classification  battery  yields  a  gain  of  65  percent  of  the  maximum,  10 
percent  higher  than  the  old. 

The  increase  can  be  translated  to  dollar  value  by  determining  to 
what  extent  training  performance  would  be  enhanced  by  using  the  new 
battery  to  make  initial  assignments  to  job  training.  This  determination 
was  accomplished  through  simulation  studies  conducted  on  a  computer. 

Test  scores  on  both  the  new  and  the  old  classification  batteries  were 
generated  by  the  computer;  the  scores  were  representative  of  recent 
Amy  input.  The  men  simulated  were  assigned  to  job  areas  in  accordance 
with  Array  quotas  for  the  different  jobs.  Each  man  was  assigned  by  the 
computer  in  two  ways — once  on  the  basis  of  scores  on  the  old  battery  and 
once  on  the  basis  of  scores  on  the  new  battery.  For  each  man,  training 
performance  was  estimated  in  both  assignments,  and  the  expected  perfor¬ 
mance  summed  for  all  men.  A  representative  group  of  men  assigned  to 
jobs  on  a  random  basis  (that  is,  with  no  knowledge  of  potential)  are  said 
to  have  an  average  expected  performance  score  of  100.  A  representative 
group  assigned  on  the  basis  of  the  old  classification  battery  produced  an 
average  expected  performance  level  of  102.5-  When  assigned  on  the  basis 
of  the  new  classification  battery,  the  average  performance  level  was 
104.5-  The  gain  of  two  points,  from  102.5  to  104.5,  can  be  translated  to 
dollars  by  considering  the  cost  and  value  of  job  training. 

According  to  recent  Army  figures,  the  median  cost  of  providing  job 
training  to  produce  a  qualified  worker  in  the  field  is  $4,000-  (While 
bhere  is  an  additional  cost  of  about  $2,000  to  procure  a  man  and  put  him 
through  basic  combat  training,  this  cost  is  not  used  in  computing  the 
value  of  job  training  performance.)  Individual  jobs  vary  substantially 
from  the  median  of  $4,000,  but  the  figure  is  a  recjonable  average  for  the 
cost  of  job  training.  Since  the  average  worker  costs  $4,000  to  train, 
in  that  sense  he  is  also  worth  $4,000  to  the  Army  upon  completion  of  job 
training.  The  below  average  man  is  expected  to  produce  less  than  the 
cost  of  training,  and  the  above  average  man  is  expected  to  produce  more. 
Following  this  argument,  anything  that  can  be  done  to  enhance  the  per¬ 
formance  of  the  workers  will  increase  the  return  to  the  Array  of  its 
investment  in  job  training. 


Preceding  page  blank 
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An  unsatisfactory  man  is  defined  as  one  who  yields  nc  return  to  the 
Army  from  the  investment  in  training;  the  $4^000  is  lost.  Hie  person 
whose  expected  training  performance  is  only  oO,  where  the  average  is  100, 
is  unsatisfactory.  being  these  definitions  as  a  basis  for  calculation, 
only  13  percent  of  the  population  of  young  men  liable  for  service  would 
be  unsatisfactory  in  the  typical  Army  job;  the  13  percent  includes  the 
10  percent  which  by  law  is  mentally  unqualified  for  service.  As  the 
scale  of  expected  performance  is  ascended,  the  return  to  the  Army 
increases,  until  at  the  average  level  of  100,  training  cost  and  value  of 
performance  are  balanced.  Assuming  a  linear  increase,  each  point  increase 
between  80  and  100  is  worth  $200.  ($4000  divided  by  the  20  points 

between  80  and  100.) 

The  scale  also  extends  to  the  positive  side.  Each  point  of  expected 
performance  above  the  average  is  also  worth  $200.  Since  the  new  classi¬ 
fication  battery  would  increase  the  level  of  expected  performance  by  two 
points,  the  value  of  the  gain  is  $400  per  nan.  With  an  annual  input  of 
200,000  men  the  Army  would  realize  an  additional  $80,000,000  ($400  times 
200,000  men)  worth  of  increased  performance  by  using  the  new  battery  to 
make  training  assignments. 

The  $80,000,000  is  a  gain  that  could  be  realized  because  of  the 
increased  effectiveness  of  the  classification  tests.  This  analysis  does 
not  imply  that  the  increased  performance  would  result  in  an  immediate 
corresponding  reduction  in  the  Army  budget.  It  does  mean,  however,  that 
for  a  fixed  number  of  enlisted  men  the  overall  quality  of  performance 
will  be  higher.  As  the  Army  force  is  reduced,  each  position  becomes 
more  important  and  the  quality  of  each  individual's  performance  more 
critical  to  the  accomplishment  of  the  Army  mission. 


APPENDIX  D  SELECTION  AND  CLASSIFICATION  PROCEDURES 


ENLISTED  SELECTION  AND  CLASSIFICATION^ 

Selection  and  classification  testing  in  the  Armed  Forces  had  its 
origin  in  World  War  I  when  the  Army  Alpha  ^pc^Beta  tests  were  used. 

During  World  War  II,  over  15,000,000  men  were  tested  with  the  Army 
General  Classification  Test. 

The  Armed  Forces  Qualification  Test  (AFQT)  was  developed  and  intro¬ 
duced  in  1950  as  a  screen  for  determining  mental  qualification  for  the 
armed  services.  Research  has  produced  improved  forms  from  time  to  time. 
During  the  20-plus  years  that  the  AFQT  has  been  used,  the  number  of 
potential  faili  res  identified  by  the  AFQT  and  kept  out  of  the  services 
is  estimated  at  over  1,000,000.  In  recent  years,  the  cost  of  acces¬ 
sioning  an  enlisted  man  and  putting  him  through  initial  job  training  Is 
estimated  at  about  $6,000.  Over  55,000  potential  failures  in  the 
services  were  identified  by  the  AFQT  in  calendar  year  I970  3nd  kept  out 
of  active  duty.  Thus,  use  of  the  AFQT  to  identify  these  potential 
failures  in  recent  years  has  avoided  annual  accessioning  and  training 
costs  estimated  at  $330  million. 

Classification  tests  are  given  to  enlisted  men  to  determine  Lhe  job 
areas  in  which  the  men  are  best  qualified.  The  payoff  from  classification 
testing  is  improved  performance  because  the  men  can  be  assigned  tc  jobs 
where  their  aptitudes  or  potential  to  succeed  are  highest.  Over  the 
20-plus  years  that  classification  tests  have  been  used.  sr.  es  time  tad 
450,000  man-years  of  improved  productivity  has  been  obtained. 

In  recent  years,  the  value  of  a  man-year  of  productivity  is  about 
$5,000.  In  one  year,  the  average  estimated  gain  in  productivity  is 
22,500  man-years;  the  annual  dollar  worth  of  classification  testing  is 
thus  about  $112  million. 

The  total,  annual  gain  in  recent  years  from  enlisted  selection  and 
classification  is  estimated  at  $442  million.  Considering  the  magnitude 
of  Department  of  Defense  operations,  the  estimated  gain  of  $442  million 
is  probably  an  underestimate  of  the  savings  from  selection  and  classi¬ 
fication  techniques.  The  alternatives  to  tests  are  random  assignment 
or  decisions  based  on  education,  work  history,  and  subjective  impressions 
regarding  individuals .  Tests  are  more  likely  to  provide  valid  assess¬ 
ments  of  current  potential  to  perform  and  are  free  of  personal  bias  and 
subjective  judgment. 


This  statement  was  prepared  by  Dr  J.  W.  Bowles,  US  Air  Force,  Dr.  M.  F. 
Wiskoff,  US  Navy,  and  Dr.  M.  H.  Maier,  US  Army. 


OFFICER  SELECTION  &  RETENTION 


The  maintenance  of  a  well-trained,  effectively  performing  officer* 
cadre  is  very  expensive.  For  example,  the  cost  to  obtaining  one  commis¬ 
sioned  officer  through  the  service  academies  is  $50,000.  Once  an  officer 
is  commissioned,  there  is  considerable  additional  investment  in  pro¬ 
fessional  training — an  investment  that  is  lost  if  the  officer  leaves  the 
service.  Personnel  research  over  the  past  20  years  has  greatly  improved 
the  capability  to  select  and  retain  officer  personnel. 


Selection  tests  used  on  applicants  for  the  service  academies  have 
identified  an  estimated  1200  potential  failures  per  year.  Screening 
tests  are  also  used  with  applicants  for  Reserve  Officers  Training  Course 
(R0TC)  and  Officers  Candidate  School  (OCS).  The  annual  savings  in 
trailing  costs  realized  through  the  use  of  these  screening  procedures  is 
estimated  at  $c0  million. 


New  techniques  have  been  developed  through  research  to  enhance  the 
overall  performance  of  commissioned  officers.  Tests  have  been  developed 
to  identify  R0TC  applicants  most  likely  to  complete  officer  training  and 
remaiu  on  active  duty  after  completing  their  minimum  requirements,  and 
classification  techniques  are  available  to  identify  officers  with  greater 
potential  in  combat  leadership  or  technical-managerial  leadership.  On 
this  basis,  officers  can  be  assigned  to  the  areas  where  their  potential 
is  high.  The  increased  retention  and  utilization  of  commissioned 
officers  resulti^s  from  use  of  selection  tests  and  classification  pro- 
ueuuifts  me  estimated  i.o  be  Wurth  $6.4  BiilllOIi  per  year. 


PILOT  SELECTION 

Trends  dating  back  to  World  War  II  have  shown  that,  without  the  use 
of  screening  tests  for  aircraft  pilots,  rates  of  elimination  from  train¬ 
ing  average  50$  or  more  of  the  input.  Since  the  number  of  pilots 
required  during  World  War  II  could  not  have  been  trained  with  such  high 
elimination  rates,  the  services  undertook  research  tr*  develop  effective 
selection  procedures  for  pilots.  The  effort  was  successful  to  the 
extent  that  elimination  rates  from  primary  pilot  training  during  World 
War  II  were  cut  in  half.  Based  on  cost  figures  of  the  time,  the  World 
War  II  pilot  selection  program  saved  approximately  $190  million  in 
training  costs. 

Tests  to  select  pilots  for  fixed  wing  aircraft  and  helicopters 
continue  to  be  used  by  the  services.  Recent  cost  analyses  (taking  into 
account  average  time  to  elimination  from  training)  show  that  each  pilot 
training  failure  costs  $16,000.  The  use  of  selection  tests  for  pilots 
has  resulted  in  an  estimated  savings  of  $52  million  per  year  in  training 
costs. 


